AITopics | dual coordinate ascent

Neural Information Processing Systems http://nips.cc/

algorithm, complexity, quartz, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom (0.14)
Asia > China > Hong Kong (0.04)
North America > United States > New Jersey > Middlesex County > Piscataway (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

077e29b11be80ab57e1a2ecabb7da330-Reviews.html

Neural Information Processing SystemsOct-3-2025, 06:33:47 GMT

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. This paper studies a mini-batch gradient method for dual coordinate ascent. The idea is simple: at each iteration randomly pick m samples and update the gradient. The authors prove that the convergence rate of the mini-batch method interpolates between SDCA and AGD -- in certain circumstances it could be faster than both. I am a little surprised that in case of gamma*lambda*n = O(1), the number of examples processed by ASDCA is n*\sqrt{m}, which means that in full parallelization m machines give an acceleration rate of \sqrt{m}.

algorithm, asdca, experiment, (10 more...)

Neural Information Processing Systems

Country: North America > United States > Nevada (0.05)

Genre: Research Report > New Finding (0.92)

Technology: Information Technology > Artificial Intelligence (0.48)

Add feedback

Quartz: Randomized Dual Coordinate Ascent with Arbitrary Sampling

Zheng Qu, Peter Richtarik, Tong Zhang

Neural Information Processing SystemsOct-1-2025, 23:15:45 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, complexity, quartz, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom (0.14)
Asia > China > Hong Kong (0.04)
North America > United States > New Jersey > Middlesex County > Piscataway (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

Communication-Efficient Distributed Dual Coordinate Ascent

Neural Information Processing SystemsSep-30-2025, 09:08:09 GMT

Communication remains the most significant bottleneck in the performance of distributed optimization algorithms for large-scale machine learning. In this paper, we propose a communication-efficient framework, COCOA, that uses local computation in a primal-dual setting to dramatically reduce the amount of necessary communication. We provide a strong convergence rate analysis for this class of algorithms, as well as experiments on real-world distributed datasets with implementations in Spark. In our experiments, we find that as compared to state-of-the-art mini-batch versions of SGD and SDCA algorithms, COCOA converges to the same .001-accurate

communication-efficient, dual coordinate ascent, name change, (4 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.08)
Asia > Middle East > Jordan (0.08)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.42)

Add feedback

Communication-Efficient Distributed Dual Coordinate Ascent

Martin Jaggi, Virginia Smith, Martin Takac, Jonathan Terhorst, Sanjay Krishnan, Thomas Hofmann, Michael I. Jordan

Neural Information Processing SystemsFeb-9-2025, 11:46:48 GMT

Communication remains the most significant bottleneck in the performance of distributed optimization algorithms for large-scale machine learning.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Virginia (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Communication-Efficient Distributed Dual Coordinate Ascent

Neural Information Processing SystemsJan-18-2025, 02:39:38 GMT

Communication remains the most significant bottleneck in the performance of distributed optimization algorithms for large-scale machine learning. In this paper, we propose a communication-efficient framework, COCOA, that uses local computation in a primal-dual setting to dramatically reduce the amount of necessary communication. We provide a strong convergence rate analysis for this class of algorithms, as well as experiments on real-world distributed datasets with implementations in Spark. In our experiments, we find that as compared to state-of-the-art mini-batch versions of SGD and SDCA algorithms, COCOA converges to the same .001-accurate

algorithm, communication-efficient, dual coordinate ascent, (1 more...)

Neural Information Processing Systems

Country: North America > United States > Virginia (0.12)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent

Shai Shalev-Shwartz, Tong Zhang

Neural Information Processing SystemsOct-6-2024, 11:12:29 GMT

Stochastic dual coordinate ascent (SDCA) is an effective technique for solving regularized loss minimization problems in machine learning. This paper considers an extension of SDCA under the mini-batch setting that is often used in practice. Our main contribution is to introduce an accelerated mini-batch version of SDCA and prove a fast convergence rate for this method. We discuss an implementation of our method over a parallel computing system, and compare the results to both the vanilla stochastic dual coordinate ascent and to the accelerated deterministic gradient descent method of Nesterov [2007].

algorithm, iteration, node, (13 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.37)

Add feedback

Communication Efficient Distributed Dual Coordinate Ascent

Neural Information Processing SystemsMar-13-2024, 10:28:43 GMT

Communication remains the most significant bottleneck in the performance of distributed optimization algorithms for large-scale machine learning.

algorithm, communication, convergence rate, (13 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Virginia (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Communication-Efficient Distributed Dual Coordinate Ascent

Jaggi, Martin, Smith, Virginia, Takac, Martin, Terhorst, Jonathan, Krishnan, Sanjay, Hofmann, Thomas, Jordan, Michael I.

Neural Information Processing SystemsFeb-14-2020, 11:42:45 GMT

Communication remains the most significant bottleneck in the performance of distributed optimization algorithms for large-scale machine learning. In this paper, we propose a communication-efficient framework, COCOA, that uses local computation in a primal-dual setting to dramatically reduce the amount of necessary communication. We provide a strong convergence rate analysis for this class of algorithms, as well as experiments on real-world distributed datasets with implementations in Spark. In our experiments, we find that as compared to state-of-the-art mini-batch versions of SGD and SDCA algorithms, COCOA converges to the same .001-accurate Papers published at the Neural Information Processing Systems Conference.

algorithm, communication-efficient, dual coordinate ascent, (1 more...)

Neural Information Processing Systems

Genre: Research Report (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback

Straggler-Agnostic and Communication-Efficient Distributed Primal-Dual Algorithm for High-Dimensional Data Mining

Huo, Zhouyuan, Huang, Heng

arXiv.org Machine LearningOct-9-2019

--Recently, reducing the communication time between machines becomes the main focus of the distributed data mining. Previous methods propose to make workers do more computation locally before aggregating local solutions in the server such that fewer communication rounds between server and workers are required. However, these methods do not consider reducing the communication time per round and work very poor under certain conditions, for example, when there are straggler problems or the dataset is of high dimension. In this paper, we target to reduce communication time per round as well as the required communication rounds. We propose a communication-efficient distributed primal-dual method with straggler-agnostic server and bandwidth-efficient workers. We analyze the convergence property and prove that the proposed method guarantees linear convergence rate to the optimal solution for convex problems. Finally, we conduct large-scale experiments in simulated and real distributed systems and experimental results demonstrate that the proposed method is much faster than compared methods. Distributed optimization methods are nontrivial when we optimize a data mining problem when the data or model is distributed across multiple machines. When data are distributed, parameter server [6], [14] or decentralized methods [15], [16] were proposed for parallel computation and linear speedup.

arxiv preprint arxiv, experiment, iteration, (12 more...)

arXiv.org Machine Learning

1910.04235

Country: